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Abstract 

^ DNA based nanostructures built on a long single stranded DNA scaffold, known as DNA origamis, 

offer the possibility to organize various molecules at the nanometer scale in one pot experiments. 

<N 

,i_> The folding of the scaffold is guaranteed by the presence of short, single stranded DNA sequences 

O 

(staples), that hold together separate regions of the scaffold. In this paper, we modelize the 
annealing-melting properties of these DNA constructions. The model captures important features 
such as the hysteresis between melting and annealing, as well as the dependence upon the topology 
of the scaffold. We show that cooperativity between staples is critical to quantitatively explain the 
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1} folding process of DNA origamis. 
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I. INTRODUCTION 



DNA origamis [1] are formed in an annealing process where a long single stranded DNA 
(ssDNA) scaffold extracted from a virus (M13mpl8) is folded with the help of a set of 
~ 200 short ssDNA strands, called staples. The resulting structure has a typical size of 
lOOnm. Locally, an origami is made of double stranded DNA (dsDNA) helices, where one 
of the strands corresponds to the scaffold. Planar and 3D structures are obtained as the 
superposition of DNA helices, with basically only one constraint: the staples should run 
from one helix to the other in such a way that mechanical deformations are minimized. This 
seemingly simple process has opened the way to the fabrication of a variety of nanostructures 
in 2D or 3D, acting as biosensors [2] or nanorabots [3-5]. DNA origamis also provide with an 
easy tool to organize molecules such as proteins or carbon nanotubes [6] at the nanometer 
scale. 

However, despite these innovative realisations, the folding process of DNA origamis re- 
mains poorly understood. Open questions include the exact role of the staple connectivity, 
the differences in yield for very similar origamis [1], or the factors that can influence the 
global stability of these nanostructures. From an experimental point of view, the process 
of formation of a DNA origami can be analysed by collecting the fluorescence intensity of 
a reporter dye [7] or by monitoring the variation of its UV absorption as a function of the 
temperature (melting or annealing curve for, respectively, increasing or decreasing tempera- 
tures) [8]. In this paper we study in depth the process of formation of DNA origamis through 
the analysis of melting and annealing curves obtained from UV absorption experiments. 

The rest of the paper is organized as follows. Section 2 describes the formation of struc- 
tures made of three ssDNA (called 'small origamis' in the following). Those can be considered 
as the building blocks of the usual DNA origamis. In this section, we also give a short de- 
scription of the experimental techniques and the interpretation of UV absorption data. In 
sections 3 and 4, the model describing the energetic cost associated with the local connectiv- 
ity of origamis, and the modelling of the annealing-melting processes are presented. Section 
5 compares the melting curves of several DNA origamis with those predicted by the model. 
The final section presents some concluding remarks. A short report of this modelling [9] 
and a comparison between the model and experimental AFM data [10] have been published 
elsewhere. 



2 



II. THE BASIC FOLDING MECHANISM UNREVEALED BY SMALL 
ORIGAMIS. 



The basic mechanism in the formation of a DNA origami is the hybridization of a sta- 
ple to the scaffold. In this section we will consider very simple systems that help in the 
understanding of this basic process. In order to simplify the reasoning, let us first consider 
the situation where one staple (call it Si) only binds to two separate parts of the scaffold 
(in Rothemund origamis, staples usually bind to three different parts of the scaffold). The 
binding to the scaffold necessarily proceeds in two steps. First, one of the two parts of the 
staple binds to the correct region of the scaffold: the zipping mechanism that ensures this 
binding is stopped by the existence of a noncomplementary region in the scaffold. At this 
stage, half of the staple is bound to the scaffold, forming a stiff double helix. The remaining 
unbound parts of the scaffold and the staple form flexible coiled coil structures. Fig. 1 
illustrates the two possible situations that can be found. When the staple is in the 'outer 
position', the coiled coil structures are on the same side of the double helix. This is usually 
called a 'bulge' in the context of DNA thermodynamics [11]. On the other hand, when the 
staple is in the 'inner position', the coiled coil structures are on opposite sides of this double 
helix. Therefore, the probability that they merge is lower and the staple is expected to 
be much less stable in this position. In Rothemund's origamis, both situations (inner and 
outer) are found. Whereas the formation of a bulge (outer position) is well documented, 
it seems that the thermodynamics of the 'inner position' binding has not been considered 
previously. This is why we decided to study the small origami structures. 

Another important aspect in the folding of DNA origamis is the interaction between 
staples. Following the simple reasoning of the previous paragraph, let us consider the influ- 
ence of a second staple S2 on the binding of Si. If the melting temperature of S2 is higher 
than that of Si (T m (S 2 ) > T m (Si)), by the time Si starts binding, with high probability 
both parts of S2 are bound to the scaffold, reducing its effective elasticity and/or length. 
This could in turn facilitate the binding of the second part of Si, resulting in some kind of 
cooperative behaviour. 

We designed a DNA construction (called small origami) made of two ssDNA 32b long 
(staples) and a 64b long ssDNA (scaffold). This structure is similar to DAO structures[12] 
and comparable in size and shape to JX and PX structures which have already been studied 
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experimentally [13], and theoretically [14], [15]. Three different sets of staples based on 
the same structure were chosen to quantitatively evidence cooperative effects during the 
binding of the staples. In the first two sets, the two staples (Bl and B2) have very different 
compositions: the sequence of Bl only contains A or T nucleotide whereas B2 only contains 
G or C nucleotide. Accordingly, their melting temperatures are far apart, respectively 57 °C 
and 91 °C . This allows to differentiate the two staples in the melting curve. The third set 
has two staples, Blm and B2m, designed with chemical sequences different enough to avoid 
mispairing with the 64b template BO. They have close melting temperatures (respectively 
77 °C and 80 °C ) as their AT/GC ratio are similar. 

The topology of the binding is illustrated in (Fig. 1): each staple contains two contiguous 
parts, 32b long, that bind to the scaffold. In (Fig. la), Bl is in the 'outer' position, B2 
is in the 'inner' position. It should be noted that besides the existence of an entropic 
hindrance, the inner position requires that double-helical domains stay in close contact, 
which could result in additional instability. Motivated by the previous considerations, we 
have investigated three different cases: 

• B2 is located in the inner part, Bl in the outer part. 

• B2 is located in the outer part, Bl in the inner part. 

• Blm is located in the outer part, B2m in the inner part. 

For each of these three cases, we have performed UV absorption measurements as a 
function of temperature. These measurements are based on the fact that hybridized bases 
absorb less than open bases: this is the so-called hyperchromic effect. The absorbance 
associated to any staple Si is: 

AbssAT) = {Abso(Si) + Abso(Si)) (1 - h Si x Si (T)) ■ (1) 

In this equation, xs^T) denotes the proportion of staples that are folded (at temperature T), 
Abs (Si) is the absorbance of Si and Abs (Si) the absorbance of its complementary staple, 
calculated according to [16]. We used the following relation for the hypocromicity h Sl : 

h Si = 0.287(1 - f gc (Si)) + 0.15f gc (Si) (2) 
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where f gc (Si) is the fraction of GC content in the staple Si. This formula has been slightly 
modified from [16] to improve the fit against the experimental data for the small origamis. 
The degree of pairing 6{T) can be obtained from the raw absorbance measurements following 
the methods in [8]. 

In (Fig. 2a) we report the derivative of the melting curves 9(T) that show the behaviour 
of the Bl strand for different configurations. Fig. 2a. A corresponds to the melting curve of 
Bl with its complementary Bl: it shows a maximum peak at 57 °C and a half width of 
4.5 K. We analyse first the case where Bl is outer. Without the staple B2 (Fig. 2a.B), the 
structure produces a loop or bulge, as described in [11] that introduces an entropic penalty 
and decreases the melting temperature to 48.5 °C and a half width of 6 K. Thus the folding 
is much less robust. When Bl and B2 are both present two events appear on the melting 
curve. The first one at 83 °C (not shown in Fig. 2a) corresponding to the folding of B2 in the 
inner position. Then Fig. 2a. C, Bl folds at a temperature higher than when it is alone, with 
a maximum peak at 51.5 °C but with a similar half width. Therefore, the inner staple B2 
helps the pairing of the outer staple Bl by suppressing part of the entropic penalty related 
to the bulge formed by the scaffold. When Bl is in the inner part (Fig. 2a.D), its binding 
is significantly destabilized: the maximum peak is located at 41.5 °C , 15 K lower than the 
value of the dsDNA and with a half width as large as 12 K. This is three times larger than 
the one of B1B1. Again, when the B2 staple is added (Fig. 2a. E ), the pairing of Bl is 
stabilized with a maximum peak much higher located at 50 °C but still with a rather large 
half width of 8 K. These experimental results support the evidence of a strong correlation 
between the two strands: the presence of B2 helps the folding of Bl whatever its location. 
Moreover, these results also show that the location of the strand is of importance, the inner 
location being much less favourable. 

Because Bl folds at a much lower temperature than B2, the binding of B2 is not influenced 
by the presence of Bl. Fig. 2b illustrates the influence of the location (inner or outer) for 
B2. The double helix structure B2_B2 (Fig. 2b. A) has a maximum peak at 91°C with a half 
width of 3.5 K, while when B2 is in the inner position (Fig. 2b. B) the peak is located at 83 °C 
with a half width of 8 K. When B2 is in the outer position (Fig. 2b. C) the peak is located at 
86.5°C and is slightly narrower (half width of 6.5 K). Therefore, the same differences between 
the inner and outer positions are observed whatever the chemical sequence involved. 

The experimental results obtained with the Blm-B2m set of staples are shown in (Fig. 2c 
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and 2d). The same trends as in the B1-B2 case are observed but with less pronounced 
effects: the temperature shifts and the increase of the half width with respect to the melting 
curves of the double strands BlnxBlm and B2mB2m are smaller. A correlation effect is also 
noticeable, and is now observed for both strands Blm and B2m. When the staple Blm is in 
the solution, the staple B2m, which folds at a higher temperature, shows a narrower peak at 
a maximum 1.5 °C higher than when it is alone. Therefore partial folding of the staple Blm 
helps the folding of the staple B2m. Similarly to what was observed in the previous case, 
the correlation effect is even more effective when we consider the influence of the B2m staple 
on the folding process of the Blm staple (Fig. 2c. D) with a shift of the melting temperature 
from 67 °C to 72 °C. 

III. HYPOTHESIS FOR THE MODEL OF DNA ORIGAMI FOLDING. 

The previous section considered the folding of simple DNA constructions that go beyond 
the simplest double stranded DNA structure. A visual inspection of Rothemund DNA 
origamis shows that small origamis can be considered as their building blocks. Therefore, 
we expect that the same physical mechanisms (cooperativity and topological effects) that 
explain the formation of small origamis will be present in the formation of larger origamis. 
However, we first need to make a few hypothesis in order to make the problem tractable. 
The basic difficulty is related to the huge number of possible configurations that need to be 
handled to compute average properties such as the number of open base pairs. Long linear 
structures of double stranded DNA can be computed rigorously because recurrence relations 
can be established in such cases [18], [17]. DNA origamis are highly connected structures 
bearing pseudoknots. This prevents the use of linear recurrences. 

The DNA origami folding process can be generally described by a set of insertion reactions 
of the form 

Si + NW^NiSjSi, (3) 

where Si denotes the %— th staple, and N(Si) represents a particular set of neighbour staples 
(to be defined below) that influence the insertion of staple Si. Eq. 3 will be made more 
precise in the following. To proceed further, we need to make further approximations. 

Each staple Si of length \Si\ can be divided in parts that hybridize to non-contiguous 
regions of the scaffold. Let us note Si = S i:1 + S ij2 + • • • such a division of the strand sequence 
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(typically, each 32b staple is divided in three parts but other partitions are possible). Staples 
in the small origamis are divided in two parts: Si = S^i + S i;2 - 

Hypothesis 1: we will focus on configurations where each part S i:k is either completely 
hybridized to the scaffold or completely unbound. Moreover, we also disregard misfolded 
configurations, that is, staples that partially hybridize to the 'wrong' part of the scaffold. 
Notice that this assumption is plausible for the one layer origamis we consider here. For 
more complex, multi-layered structures, the staples are divided in smaller parts so that the 
probability to bind to the wrong part of the scaffold is considerably increased. We also 
disregard possible secondary structures of the scaffold. 

We will call crossover the connection between two contiguous parts of any staple. A 
crossover is not associated with a particular DNA base, it is only a convenient notation to 
describe the connectivity of the origami. In Rothemund's origamis [1], typical staples are 
32 bases long and composed of three parts (8,16,8 bases long respectively) linked by two 
crossovers cS it i and cS i)2 - On the scaffold side, a crossover is associated with a loop, a subset 
of the scaffold that is hybridized (or not) depending on the presence of other staples. In the 
previous example (Fig. la) Bl and B2 are composed of two parts connected by a crossover, 
and BO plays the role of the scaffold. 

Hypothesis 2: configurations with non contiguous hybridized parts are forbidden. This 
hypothesis is well verified when the central part of the staple is much longer than the other 
parts. In the following, we will note Si(k,l)(k < I) the configuration where S^, S it k+i, ■ ■ ., 
Sij are hybridized, the other parts being unpaired. 

The model aims to compute the probability p(Si(k,l),T) of having a particular folded 
state of the staple Si at temperature T. We will assume that at very high temperature 
T = Th, all the staples are unfolded: p(Si(k,l),Th) = (in practice, Th = 90°C ). The 
model is recursive: p(Si(k, l),T + dT) is computed based on the knowledge of p(Si(k, l),T). 
The increment dT can be positive or negative: the algorithm starts from T h , the temperature 
decreases down to a value 7], then increases again. At any temperature T, the probability 
to observe a given configuration Si(k, I) will depend upon the presence (or not) of neighbour 
staples. Therefore, for each staple a set of possible neighbour staples {Ni(Si), A^S^), . . .} 
needs to be defined. In the following, we will use the generic notation N a (Si) for any of 
these neighborhoods. 

How many staples one has to consider in each of these sets is a parameter of the model. 
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In this paper, all the origamis are such that the crossovers can be aligned in rows. In the 
following, for each staple and crossover, the set of neighbour staples will be limited to those 
that have crossovers in the same row, and are separated by less than 75b. Let us note that 
all the results on this paper are robust against variations of this parameter. With these 
notations, the probability to observe the staple Si in a given configuration Si(k,l) and for a 
given neighbourhood Nj(Si) is modelized by an equilibrium reaction: 

Si(k, I) + N a (S t ) ^ N a (Si)Si(k, I). (4) 

This modelling therefore does not consider any kinetic effect. Notice however that the 
equilibrium hypothesis is a local one. Indeed, we expect that the model will reflect the 
hysteresis observed in the annealing-melting of DNA origamis. 

Hypothesis 3: because the model only keeps track of the single probabilities 
p(Si(k,l),T) and not of the joint probabilities p(Si(k,l), S 2 (k f ,l'), . . . T), it is necessary to 
make an additional approximation to determine p(N a (Si),T). Again, based on the data 
from the small origamis, we assume that there is a strong correlation between the different 
staples. As the processes of annealing and melting are monotonous, for two staples and 
S i2 we make the hypothesis that if p(S il (k,l),T) < p(S i2 (k f ,l'),T), then the S i2 staple is 
present in the structure when S^ starts to fold. In order to compute p(N a (Si),T), let us 
generalize this idea and order the staples in N a (Si) = {S^, S i2 , . . .} in such a way that 
p(S il ,T) < p(S i2 ,T) < . . . < p(S c ) = 1, where S c stands for the scaffold. According to the 
high correlation hypothesis, we approximate the joint probabilities in the following way: 

p(S h ,T) = p(S H ,S l2 ,...) 
p(S i2 ,T) - p(S il ,T) = p(S i2 ,S i3 ,...),... 

For instance, in the case where only two crossovers influence Si, with probability p(<S'i 1 ) 
both cS^ and cS i2 are present, the probability of only having cS i2 is p(S i2 ) —p(S il ) and the 
probability of only having the scaffold is 1 — p(S i2 ). 

The probability p(Si,T) for staple Si to be hybridized at temperature T is the solution 
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of the coupled set of equations: 



p(Si,T)= p(Si,T\N a (Si))p(N a (Si),T). 



(5) 



N a (Si) 



This set of equations can be further approximated by the iteration: 



p(Si,T)= p(Si,T\N a (Si))p(N a (Si),T-dT). 



(6) 



N a (Si) 



Here, dT denotes the temperature step that determines the annealing-melting protocol 
and p(Si,T\N a (Si)) the conditional probability to hybridize staple Si in the neighborhood 
N a (Si). In the annealing process (dT < 0), at high temperature Too, p(N a (Si,T = Too)) = 
excepted for the empty neighborhood (the staple hybridizes to the scaffold in the absence of 
any other staple). For lower temperatures, p(N a (Si),T) can be computed from the knowl- 
edge of p(Si,T) and the correlations between staples (hypothesis 3). We show in appendix 
A that the set of coupled equations 5 determines p(Si(k,l),T) provided the equilibrium 
constants of the reactions 4 are known. This amounts to define an energy model which is 
detailed in the next section. 

IV. MODELLING THE ENERGY OF DNA ORIGAMIS 

In this section, we introduce an energy model for the hybridization of staples to 
the scaffold of a DNA origami. The gain in Gibbs free energy for hybridizing Si(k,l), 
AG(Si(k,l),T) = AH(Si(k,l),T) -TAS(Si(k,l),T) contains two contributions: 



The local contribution AGnn only depends on the sequence of Si(k, I). It quantifies the gain 
in free energy associated with the local formation of double helices. We use the parameters of 
the nearest-neighbour model [11] with a temperature correction given in [19] (see appendix 
B and [19] for a detailed description). 

AG t0 p gathers several contributions that depend on the connectivity of the origami 
(AGnn depends mostly on the sequence of the scaffold and, to a less extent, on the density 



AG = AG NN + AG, 



top- 



(7) 
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of crossovers, but not on the connectivity). With any crossover, we associate an entropic 
penalty. This penalty reflects the difficulty for a staple to hybridize non- contiguous parts of 
the scaffold. In a first approximation, the longer the region of the scaffold that connects the 
two parts to be hybridized, the larger the penalty. Our previous results obtained with the 
small origami show that this needs to be refined. Based on these data, we consider three 
situations characterized by transient arrangements of staples that we call local intermediate 
states (LIS). In the first one (LIS1), the staple hybridizes to the scaffold, forming an internal 
asymmetric loop [11], Fig. 3a. The length of this loop corresponds to the number of unpaired 
bases of the scaffold linked by the crossover. This is a generalization of the 'outer' position 
found for the three strands origami. Before the crossover formation, when only part of the 
staple is folded, the scaffold and the non hybridized part of the staple are on the same side 
of the hybridized part of the staple (Fig. lb). In this case, the staple is not involved in the 
path that connects the two extremities of the crossover. A particular case of LIS1, which we 
call LIS2, is the situation where the length of the loop is zero: the crossover forms locally a 
Holliday junction, the staple hybridizes in the close vicinity of an already hybridized staple 
(Fig. 3b). 

The third LIS, LIS3, corresponds to the inner position in the small origami: the shortest 
path that connects the two ends of the crossover involves the staple itself (Fig. 3c). Because 
of this, before the crossover forms, the non hybridized parts of the strand and the scaffold 
are located on opposite sides of the hybridized parts (Fig. lc). Therefore, LIS3 implies a 
larger penalty than LIS1 or LIS2. In the small origami, the shift in T m was less than 10 °C 
for LIS1, between 5 °C and 7 °C for LIS2 and up to 15 °C for LIS3. 

To each of these LIS is associated a different AG top '- 

• AG top (LISl)= -TAS bu i ge (n T - 0.8 < nb fo i ded >). The function AS bu i ge (n T ) is that of 
ref. [11]. Tlx corresponds to the number of bases along the scaffold and < nbfoided > is 
the average value of bases folded along the scaffold (this average takes into account the 
probabilities of all the possible neighbouring configurations). The comparison between 
this model and the experimental results from the small origami structure is illustrated 
in Fig. 4. 

• AG top (LIS2) = AH loop0 - TAS loop0 with AH loop0 = 25.3kcal/mol and ASi oop0 = 
65.0cal/mol/K. This constant contribution has been derived so as to fit as well as 
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possible the B1-B2 experimental data (left of Fig. 5) and then applied to the Blm-B2m 
data (right part of Fig. 5). 

At the crossover, two bases that belong to 5, face the bases constituting the crossover 
made by the other strand. The initial enthalpic and entropic contribution of this pair 
of bases is subtracted from AG^n as they are not nearest-neighbours anymore. One 
half of the contribution (nearest-neighbour model) of the two new pair of bases is 
added. 

• AG top (LIS3) = AH unbind - TAS unbind - TAS hulge {n t - 0.8 < nb folded >) (Fig. 3c): an 
additional penalty is added to the entropic penalty of the loop. This intends to reflect 
the stronger instability characteristic of this LIS. AH un u nd (resp AS un bind) quantifies 
the loss of enthalpy (resp entropy) associated with the partial unfolding of the ends of 
an staple involved in this type of LIS. The number of bases that unfold is a parameter 
of the model. The data in Fig. 6 correspond to the unfolding of a total of 8 bases (two 
bases for each of the four extremities of the staple, see Fig. 3c). 

Under some circumstances (Fig. 3d), two types of LIS can be attributed to a given 
crossover. In such cases, the LIS with the smaller AG top is taken into account. 

The modelling obtained with the contributions AGnn and AG top is quite satisfactory 
except for a constant negative shift (~ — 4K) of the melting temperatures. This shift indi- 
cates that another stabilizing mechanism that is not present in small constructions such as 
the small origamis has to be invoked. Indeed, in the folded structure of origamis, double- 
helix sections are separated by distances of the order of lnm. It is then reasonable to think 
that mechanisms such as correlations between counter-ions and hydration forces come also 
into play, as is the case when DNA condensates [20]. This electrostatic stabilizing term is 
proportional to the number of neighbouring bases n n that are close to the staple Si, and to 
the length \Si(k,l)\ of the partial configuration Si(k,l) considered. 

AH n (Si(k,l)) = -0.132n n \Si(k,l)\/\Si\(kcal/mol) (8) 

The energy per base —0.132kcal/mol/base is similar to the one needed for DNA condensation 
lO^ksT/base to 10~ 2 kBT/base [21]. An alternative explanation for the presence of this 
term would be the entropy reduction due to the confinement of the double helices in a 2D 
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structure. 

The probability to fold any part Sij of staple Si is given by the sum of the probabilites 
of any configuration Si(k,l) that contains Sij: 

P (Sij,T)= Yl p(ss>p)>n ( 9 ) 
i,p,i<j<p 

The fraction of folded bases for staple Si 

^(t) = 4 t Ep( 5 «' t )I 5 «I ( 1Q ) 
\z>i\ j 

can be then converted to a theoretical absorbance [16] as in the case of small origamis 1. 

V. COMPARISON WITH THE ANNEALING-MELTING DATA OF DNA 
ORIGAMIS 

We considered four different DNA origamis (Fig. 7) with the same scaffold (M13mpl8 
virus) and about 200 staples. 01 is the rectangle in the original Rothemund work [1], the 
staples are mostly 32b long, with folding part sequences divided in 8-16-8 patterns. 02 is 
another rectangular origami that includes a hole [23] and presents the same 8-16-8 pattern. 
03 has the same connectivity pattern as 01, but some staples have been merged two by 
two in four areas (coloured in black in 7c), so that the typical staple pattern is 8-16-16-16-8. 
Finally, 04 is another rectangular origami where a 100b long subset of the scaffold goes from 
one side to the other of the rectangle, forming a ssDNA 'bridge'. 

For each origami, a series of annealing-melting cycles was performed coupled to UV- 
absorption measurements. With the methods of [8], the degree of pairing 9(T) was extracted 
as a function of temperature. The temperature ramp (0.4 °C min -1 ) is typical for this one- 
layer origamis. The annealing-melting process is not symmetrical, the hysteresis between 
the two phases of a cycle is such that the melting takes place at temperatures higher than 
the annealing. 

There is an overall agreement (Fig. 7) between the melting-annealing curves observed 
experimentally and computed with the model. The model captures the hysteresis between 
the annealing and melting processes, as well as the relative strength of this hysteresis between 
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different origamis (02 has only 4K shift between annealing-melting, whereas 04 has 10K 
shift). The maximum value of the derivative, which can be linked to the overall enthalpy of 
the transition in a two-state model, is also reproduced. This feature is robust against small 
variations of the parameters of the model. 

A. Understanding DNA Origami design 

In this section, we will rely on the model developed in the previous section to explore how 
the melting temperature of the rectangular origamis depends on the specific connectivity. 
This type of considerations could be relevant in applications where it is necessary to improve 
the stability of the template against temperature (DNA origamis as platforms controlling 
chemical reactions or other applications including grafting inorganic species). 

1. Circular permutation of the scaffold. 

The scaffold used for the design of rectangular origamis is a circular phage. Thus, it is 
possible to choose the beginning of the scaffold sequence anywhere so that 7248 different sets 
of staples (with same length and position, but with a different composition) are possible. 
We compared the melting curves given by those permutations on the 04 origami, permuting 
the sequence in steps of 16 bases as this shifts the middle of one staple to the middle of 
the nearby staple. The distribution of temperatures corresponding to the maximum of the 
derivative for the annealing and melting curves (Fig. 8a) shows an amplitude of variation 
of about 4 °C . Depending on the permutation, the melting curves can be very different 
in shape (Fig. 8b). We conclude that, for a given scaffold, there is some sort of invariance 
regarding the choice of the origin. 

2. Decreasing the number of crossovers. 

In our model, a penalty is associated to each crossover. Reducing the number of crossovers 
should in principle increase the stability in the annealing process. We started from the 01 
shape to reduce the number of crossovers, as its design is more regular. In the initial origami 
there is a length of 32 bases between two crossovers, which corresponds to three periods in 
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the double helix. Increasing the distance between crossovers leads to consider 54 bases (5 
double- helix periods). We considered two possibilities, illustrated in Fig. 9: staples 27b long, 
split in 13-14 (06 origami), and staples 54b long, split in 13-27-14 (05 origami). Indeed, 
there is a trade-off between the gain in enthalpy when increasing the length of the staple, 
and the additional penalty of having two crossovers/ staple instead of only one. Our model 
shows that the net gain in stability, compared to the initial 8-16-8 staple strategy, is almost 
20 °C (Fig. 9 05). Again, the comparison with the experiments is excellent. Notice that 
decreasing the number of crossovers could have an impact on the flexibility of the origami. 

VI. CONCLUSION 

DNA origamis appear as a versatile tool to design various types of DNA based nanostruc- 
tures. We have introduced a simple algorithm based on known thermodynamic properties 
of dsDNA and their parameterization with the NN-model [11]. The energy model is based 
on a preliminary study on small origamis that points to the importance of crossovers and 
their associated entropic penalty and correlation effects. This algorithm provides a reason- 
able account of the observed melting and annealing behaviour of DNA origamis. The model 
reproduces hysteresis and melting temperatures, as well as the width of the melting curve. 
The hypothesis of local equilibrium turns out to be compatible with the experimentally 
observed hysteresis. The model emphasizes the role of cooperativity in the folding process 
by introducing correlations between the probability of presence of neighbour staples. This 
shows that the folding of each staple is strongly dependent on the existence of a nearby 
cluster of folded staples. An intuitive picture can be derived from this modelling. In the 
annealing process, only the staples with a high GC content are able to fold at high temper- 
ature. These staples provide with initial nuclei that facilitate the folding of the neighbour 
staples. Therefore, the folding of staples with high AT content will take place at a tempera- 
ture which is dependent on the distance to the set of these initial nuclei. During the melting 
process, the unfolding of any staple is mostly independent on its neighborhood and it is well 
accounted by an average penalty in free energy. The histeresis is then explained by the fact 
that the neighborhood of each staple is different in the annealing or melting. 

As a possible application, we have shown that the model allows to improve the thermal 
stability by quantifying the effect of different construction factors such as staple length and 
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density of crossovers. Extensions to 3D [24] [25] and structures other than origami [26] 
are envisioned, as well as tests at the single molecule level (FRET). As shown in [10], the 
interaction with the depositing surface will be crucial to interpret future AFM measurements 
on the folding of DNA origamis. 
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Appendix A: Computing the probabilities from the law of mass action. 

In this section, we explain how to compute the probability p(Si(k, l),T) by solving a set 
of coupled equations which reflect the assumption that the reaction 



is governed by the law of mass action. For the sake of clarity, we consider first the particular 
case illustrated in Fig. 10: a staple divided in two parts of equal length is inserted in the 
vicinity of another staple that holds together a portion of the scaffold. For this particular 
example, we nees to consider three equilibriums: the simultaneous binding of the two parts 
of Si, with equilibrium constant: 



Siik^ + N^^N^SiikJ) 



K. 




( 



AG NN (Sj(l,2),T) 
kT 



exp(-AS bulge (2L))/k 



(Al) 



The binding of only one half S^l, 1) of the staple, with equilibrium constant: 




(A2) 



The binding of the other half Si(2, 2) of the staple: 




(A3) 
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More generally, for any set of equilibriums between Si and its neighborhood N a (Si), the 
law of mass action reads: 



msMSi] ~ S ^ p) ~ p V kf ) (A4) 

The concentration of free staples in solution is given by 

C (exs-J2 E P(Si(u,v)\Np(Si))p(N p (Si)) 

\ fS u,v,u<v / 

Exs is the excess of staple concentration, compared to the initial concentration of scaffold 
Co, p{Si{u,v)\Np{Si)) the conditional probability to observe Si(u,v) given the neighbour 
staples Np(Si) and p(Np(Si)) the probability of the neighborhood Np. In order to get a 
closed set of equations, we approximate p(Np(Si),T) ~ p(A r g(S , j),T — dT) where dT is the 
temperature step in the melting-annealing process. 

As the excess in DNA origami annealing experiments is important, the concentration of 
free staples will vary weakly in the process of formation. It is then approximated by CqExs. 
The concentration of free N a (Si not bounded) is given by: 

C (p(N a (Si))- p(Si(u,v)\N a (Si))p(N a (Si)) 

\ U,V,U<V / 

The concentration of the configuration Si(l,p) hybridized in the neighborhood N a (Si) is 

C p(Si(l,p)\N a (Si))p(N a (Si)). 
The approximated expression of the equilibrium constant is therefore: 



p(^(/,p)|jVq(^))p(iV a (^)) 

rsj 

ExsCq (p(N a (Si)) - p(S l (u,v)\N a (S t ))p(N a (S l )) 

\ u,v,u<v 



(A5) 



This set of equations can be solved by noticing that the denominator in A5 is independent 
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of Si(l,p). Thus, for any couple of configurations (Si(l,p), Si(u,v)), 



p(SS, P )\N a (S t )) _ 



Let y a = ^ p(Si(l,p)\N a (Si)) and S a = ^ ^siipf- Then, by summing over / and p eq 



A5 reduces to: 



C Exs l-y a 
or: 



' S a (A7) 



C EXSS a 

y ° " 1 + C Sxa5 a [ } 



and finally 



p(S t (l,p),T) = J2p(Si(l,p)\N a )p(Na(Si),T) ~ Yj>(N«(Si),T-dT)y a -^ (A9) 
Appendix B: Temperature and salt corrections 

The computation of the nearest-neighbour contribution AG^n — AH^n — TASnn in- 
cludes corrections to take into account temperature and salt variations. 

1. Temperature corrections 

Besides the nearest-neighbour contributions (AH^, AS%) [11], we have also included 
temperature dependent corrections: 

AH NN (Si(l,p)) = AH° N (S t (l,p)) + C p \l-p+l\(T-T ref ) 
AS NN (Si(l,p)) = AS° N (Si(l,p)) + C p \l-p+l\hv(7^) 

J- ref 

where C p = —A2cal / mol / K /bases and T re f = 53° C according to [19]. 
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2. Salt corrections 



The parameters of the NN model [11] are given for standard salt concentrations ([Na]=lM, 
[Mg]=0). Different salt conditions can be taken into account using the correcting terms in 
[22]. These corrections apply when Mg is dominant. We assume that for each attachment 
of the staple Si on the configuration Si(l,p) (of length \Si(l,p)\), the relation 

/(M^7Va,/ GC ,|^,p)|) 



T m (Mg, Na, C, S l (l, p)) T m (0, Na, C, $(Z, p)) 

between the melting temperatures T m at different concentrations of Mg and Na holds (Jgc 
is the fraction of GC content of Si(k,l)). 

The melting temperature corresponding to the point where AG(T m ) = 0: 

T m (0,Na,C,S t (l, P )) = A c ^fffTT^ (Bl) 

AS NN (Si{l,p)) + Rln{C/A) 

it can be deduced that the salt corrections are taken into account by an entropic correction 
given by: 

SC(Mg,Na) = f(Mg, Na, / GC , |^(Z,p)l)Atf (B2) 

where AH is the sum of all the contributions previously cited (nearest-neighbour, tem- 
perature corrections, topological contributions). In this paper, we modified the function 
f(Mg, Na, Jgc, \Si(l,p)\) as given in [22] by a small additive term to take into account low 
AT content strands. In order to do this, we compared the predictions of [22] with the exper- 
imental results obtained with the small origamis. In [22], the authors calibrated their model 
against 17 different dsDNA, involving a wide range of salt concentrations, and obtained a 
mean deviation —1.7 ±0.7 °C . We deal here with a more restricted range of salt concentra- 
tions, it is therefore expected that the model [22] can be improved. We used a correction to 
the / function of [22] for the low fractions fee of GC content, to obtain a mean deviation 
of —0.4 ± 0.3 °C . The corrected / function reads: 

fne W = f+ {fgc ~ 0.5)0.00008 if f gc < 0.5. 

The results obtained with this new expression for the strands involved in the differents 
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small origamis are illustrated in Fig. 11. 
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Figure 1: (a) Schematic representation of the connectivity of the small origami, (b) and (c) Transi- 
tory situation correponding to the binding of half of the staple to the scaffold, (b) Bl staple is in 
the 'outer' position, (c) B2 staple in the 'inner' position, (b) and (c) show that the partial binding 
of staples in the outer (b) or inner (c) positions are very different. 
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Figure 2: The derivative d9/dT reported in the four figures corresponds to the folding of the dotted 
staple, a) experimental data on the folding of Bl(AT) cases (A,B,D) in the absence of B2(GC), 
cases (C,E) with B2 already folded; b) experimental data on B2 without Bl; c) experimental data 
on Blm; d) experimental data on B2m 
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Figure 3: representation of the three different local intermediate states (LIS). The staple to be 
inserted is represented by the dotted line, the scaffold by the continuous line, (a) LIS1 corresponds 
to the formation of a bulge, (b) LIS2 corresponds to the case where, due to the binding of other 
staples, the effective length of the bulge is zero, (c) LIS3 generalizes the 'inner position' of the small 
origamis. We assume that, because of the curvature constraints imposed by this configuration, the 
staple is not able to fold completely, (d) A typical situation where two types of LIS (LIS1 at the 
right side of the staple, LIS3 at the left side) can be attributed to a given crossover. 
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Figure 4: Melting curves obtained for a small origami containing only one staple in the outer 
position (inset). Comparison between the experimental (dotted line) derivative dO/dT and the 
modeling (continuous line) as provided by the contribution AGf op (LISl). In order of increasing 
melting temperature: Bl staple (blue), Blm staple (green) and B2 staple (red). 
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Figure 5: Melting curves obtained for a small origami containing two staples (configuration rep- 
resented in the inset). Comparison between the experimental (dotted line) derivative d6/dT and 
the modeling (continuous line) as provided by the contribution AGto P (LIS2). The curves clustered 
around T m ~ 50 °C correspond to Bl in position outer (cyan),Bl in position inner (magenta), 
the modelling does not make any difference between these two cases. The curves clustered around 
T m ~ 70 °C correspond to the simultaneous binding of Blm (inner) and B2m (outer). 
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Figure 6: Melting curves obtained for a small origami containing only one staple in the inner 
position (inset). Comparison between the experimental (dotted line) derivative dO/dT and the 
modeling (continuous line) as provided by the contribution AGi 0p (LIS3). In order of increasing 
melting temperature: Bl staple (blue), B2m staple (black) ,B2 staple (red). 
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Figure 7: Derivative of the pairing degree versus temperature for the 01, 02, 03 and 04 origamis 
represented in the insets. The data corresponding to annealing are in red, melting in green, the 
model (dashed line) is in blue for both processes. 
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Figure 8: (a) Distribution of melting temperatures (annealing and melting) as a function of the 
order of the circular permutation of the scaffold strand, (b) two different melting curves corre- 
sponding to two permutations with the lowest and highest annealing temperatures. 



24 



0.18 
0.16 
0.14 
0.12 

rO.10 



8-16-8 



13-14 




13-27-14 




8-16-8 



13-14 



13-27-14 



Figure 9: Annealing curves of the 01, 05 and 06 origamis. The three origamis correspond to the 
same scaffold pattern, but different staple pattern (solid line = experimental data, dashed line = 
theoretical curves). The lower panel represents a detailed view of the differences in connectivity 
between the three origamis. 
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Figure 10: Different possibilities for the staple Si to bind in the neighbourhood N a (Si) 
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Figure 11: (dotted line = experiment, solid line = theoretical) Comparison between the model and 
the melting experiments for four different dsDNA 
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